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Abstract 

The opportunistic beamforming in tiie downlink of multiple-input single-output (MISO) systems forms N transmit 
beams, usually, no more than the number of transmit antennas A^t. However, the degrees of freedom in this downlink 
is as large as A'^^'^. That is, at most rather than only A'^t users can be simultaneously transmitted and thus the 
scheduling latency can be significantly reduced. In this paper, we focus on the opportunistic beamforming schemes 
with Nt < N < Nf transmit beams in the downlink of MISO systems over Rayleigh fading channels. We first 
show how to design the beamforming matrices with maximum number of transmit beams as well as least correlation 
between any pair of them as possible, through Fourier, Grassmannian, and mutually unbiased bases (MUB) based 
constructions in practice. Then, we analyze their system throughput by exploiting the asymptotic theory of extreme 
order statistics. Finally, our simulation results show the Grassmannian-based beamforming achieves the maximum 
throughput in all cases with Nt = 2, 3, 4. However, if we want to exploit overall Nt degrees of freedom, we shall 
resort to the Fourier and MUB-based constructions in the cases with A'^t = 3, 4, respectively. 
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I. Introduction 

Multiple-input multiple-output (MIMO) system holds promise for the next generation wireless communications 
due to its high spectral efficiency [1], [2]. In a single-user MIMO system, its capacity has been extensively 
investigated, assuming different channel state information (CSI) is known at the transmitter and/or receiver [3], 
[4]. In the multi-user scenario, the multi-user diversity was introduced as a new dimension of degrees of freedom 
to further increase the capacity [5]-[8]. In this paper, we focus on the downlink of multi-user MIMO systems, i.e., 
broadcast channels (BCs). By using dirty paper coding strategy at the transmitter [9], the optimal sum-rate capacity 
region of MIMO BCs was well established from the information-theoretic viewpoint, with the assumption that CSI 
is perfectly known at the transmitter and all the receivers [10], [11]. This region can be numerically evaluated by 
using the duality between BCs and multi-access channels (MACs), though it is extremely computationally intensive 
[12]-[14]. In practice, the optimal sum-rate capacity region of MIMO BCs can be approached using the nested 
lattices or trellis beamforming scheme [15], [16], which generalizes the Tomlinson-Harashima beamforming [17], 
[18]. Unfortunately, perfect CSI at the transmitter is almost infeasible in practical communication systems with large 
number of users, and also the non-linear beamforming is usually impractical for the real-time traffic. Therefore, 
designing Unear beamforming schemes with lower feedback complexity is of great interest [19]-[21]. 

The opportunistic beamforming system (OBS), also known as the random beamforming system, is shown in [22] 
to achieve the maximum sum-rate capacity with the minimum amount of feedback, provided that the number of 
users is not smaller than the number of transmit antennas. This condition is surely satisfied in the practical cellular 
systems. The single-beam OBS is proposed in [23], in which the conceptual idea of multi-beam OBS is also 
presented in [23, Appendix B]. The detailed analysis on the throug hpufl of OBS with multiple orthogonal transmit 
beams is performed in [24], [25]. Moreover, the opportunistic beamforming with only signal-to-interference-plus- 
noise ratio (SINR) feedback is generalized in [26] to the case with composite feedback consisting of quantized 
channel directional information and channel quality information (channel magnitude or SINR). 

In the literature with respect to multi-beam OBS [24]-[29], random vector quantization (RVQ) limited feedback 
MIMO systems [26], [30]-[32], or the 3GPP Long Time Evolution (LTE) of 3G systems [33], it is always assumed 
that the number of transmit beams N is identical to the number of transmit antennas Nt and thus there are at most 
Nt users can be simultaneously transmitted. In other words, the beamforming matrix is a square matrix with size 
Nt X Nt- However, it is shown that the optimal transmission strategy regarding the sum-capacity criterion in MIMO 
broadcast channels with large number of users involves more than Nt transmit beams at the same time but upper 
bounded by N^, i.e., Nt < N < N^ [34]. If each user is equipped with A'^ > 1 receive antennas, he/she can receive 
up to N^ data streams [34]. This allows the user to increase his/her own data rate but it prevents the simultaneous 
transmission by other users, so that the number of simultaneously transmitted users is limited to be \Nt/N^~\, 
where we assume that each user receives exactly N^ data streams and \x~\ denotes the integer ceiling operator [34]. 

'in this paper, the term "throughput" refers to the average sum rate capacity, and the link-adaptive techniques, such as adaptive 
coding/modulation, finite constellation and dynamic power allocation, are not taken into account. 



SUBMITTED TO IEEE TRANS. INF. THEORY, SEPT. 2, 2008; REVISED ON JAN. 5, 2009 



2 



In other words, there are at most degrees of freedom in the extreme case with Nr = 1- Throughout this paper, 
we suppose there is only N,. = 1 receive antenna for each user and thus there are at most Nf users that can be 
simuhaneously transmitted. The beamforming matrix is now oblong with size Nt x N^. Unfortunately, [34] does 
not show us the implementation of beamforming schemes with Nt < N < Nf simultaneously transmitted users. To 
the best of authors' knowledge, only the case with N = Nt + 1 scheduled users is addressed in [35] by exploiting 
the tight Grassmannian frames. 

In this paper, we show how to schedule Nt < N < N^ users simultaneously. Specifically, we design the 
opportunistic beamforming schemes with Nt < N < Nt transmit beams in which one user is scheduled at 
each beam. In particular, Nt is supposed no larger than 4, just as that in 3GPP LTE [33]. Unlike the orthogonal 
transmitting case [24], the orthogonality between different transmit beams is not retained again if > A'^j. More 
precisely, the rank of beamforming matrix B G c^tx^ jg certainly no larger than Nt, where C stands for the field 
of complex numbers. That is, there are at least N — Nt transmit beams that are no longer be orthogonal with the 
others. However, if the transmit beams are generated as at least correlated as possible, more transmit beams benefit 
to schedule more users as soon as possible and hence decrease the scheduling latency. Unfortunately, the increased 
multi-user interferences and the loss of orthogonality between transmit beams will inevitably deteriorate system 
throughput. Therefore, there must be a tradeoff between more and more transmit beams and increased multi-user 
interferences as well as disappearing orthogonality. In this paper, we first show how to construct the beamforming 
matrices and then the system throughput is rigorously investigated. 

The rest of this paper is organized as follows. We present the system model and scheduling strategy in Section 
Hn In Section |llll we show how to design the beamforming matrices with constrained correlation property. Then, 
the system throughput is analyzed in Section HV] Simulation results and discussion are presented in Section |V] and 
finally. Section |VI] concludes the paper. 

II. System Model And Scheduling Strategy 

A. System Model 

In this paper, we consider the downlink of a homogeneous single-cell cellular system where the base station with 
Nt antennas transmits packets to K single-antenna users, that is, the number of receive antennas Nr = 1 for each 
user. The number of users K is assumed no less than N^ and all users are scattered geographically and do not 
cooperate]^ moreover, their average SNRs are identical. Block flat Rayleigh fading channels are supposed and all 
the time indices are omitted for the sake of notation brevity if no other specific statement; furthermore, different 
channels among users are mutually independent. In addition, we suppose that the transmission time is divided into 
consecutive and equal time slots, and each time slot is less than the possible time delay but long enough so that 
there is a coding strategy available that operates closely to Shannon channel capacity. Moreover, each time slot is 

^The case in which the number of users and the number of transmit antennas are of the same order is addressed in [30], [36] and the references 
therein. 
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Fig. 1. The block diagram of OBS with multiple transmit beams 



divided into a number of equal sized mini-slots and several initial ones are used to transmit common pilot symbols, 
so that the base station can determine which users shall be chosen for data transmission in the rest mini-slots 
according to the feedback of each user. 

The OBS with N transmit beams is illustrated in Fig.[T] At the base station, N different beams are simultaneously 
transmitted during one time slot, where N ^ [1, Nf]. When iV = 1, it denotes the single-beam transmission [23]. 
When N = Nt, it refers to the conventional multi-beam orthogonal transmission [24]-[26]. In this paper, we 
concentrate on the cases with Nt < N < N^. 

When pilot symbols are transmitted during the first several mini-slots ]^ S C comprises 

N different elements simultaneously transmitted at N different beams. Furthermore, x^^, n — 1, ■ ■ ■ , N is simulta- 
neously sent out from Nt transmit antennas with each being multiplied by a beamforming coefficient .^ya~"e^^' " 
at Antenna i, 1 < i < Nt and Beam n, 1 < n < N. That is, N different pilot symbols are needed to distinguish 
N transmit beams. On the other hand, when data is transmitted during the later mini-slots, refers to user data 
transmitted at Beam n. Then, the received symbol of User k, G C^^^, is given by 

— N 

n=l 

where p is the average received SNR for each userjj e C^^^ stands for additive white Gaussian noise with zero 
mean and unit variance; e i^ixNt denotes the complex channel vector between User k and the base station, 

'it is shown that on an average 2.5 mini-slots is required to find the scheduled users [37]. 

'^In order to make a fair comparison between different transmit schemes, the transmit power in our proposal is normalized such that it is 
independent of the number of transmit beams TV as shown in (T), whereas in the conventional orthogonal opportunistic beamforming systems 
the transmit power is assumed to be identical with [24, Footnote 3]. 
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and it agrees with Rayleigh fading with zero mean and variance 1/m. Moreover, the instantaneous beamforming 
matrix B{t) e C^'^^ at time slot t can be written as 



where 



b = 



(2) 



n = l, •••,7V (3) 



is the beamforming vector at Beam n, and (.)^ denotes the transpose operator. 

In the case with N — Nt, the amplitudes a. ^, i — 1, Nt in ^ are uniformly distributed over [0, 1) such 
that X^t^'i Q^i n — 1' the phases 9.^,i — l,---,Nt are independent and uniformly distributed over [0, 2tt). 
Moreover, different beamforming vectors are orthogonal with each other, that is, 

„ \ I, I = n 

bfb^ = { l,n^l,---,Nt (4) 

[ 0, Ij^n 

where (.)^ denotes the Hermitian transpose operator Now, B{t) is a unitary matrix and it can be generated according 
to an isotropic distribution. However, in the cases with Nt < N < N^, the beamforming vectors b^, n ^ 1, ■ ■ ■ , N 
are no longer orthogonal with each other 

B. Scheduling Strategy 

As far as the scheduling strategy is concerned, we assume that each user sends them back to the base station, 
his/her maximum received SINR and its corresponding beam index among N different beams|f| The feedback 
beam-index e [1, N] of User k is determined by 

"fc=arg max (5) 

n— 1, ■■■ , A* 

where |a;| denotes the amplitude of x. 

According to ([T]i, the received SINR of User k at Beam n is 

7„.. = ^^1^^ (6) 



1 + ^ E \hA? 



N 

Z=l, l^n 



Hence, his/her maximum SINR among N beams is 



7^ = max 7^ ^ (7) 
■■■ ,N ' 

Combining (|5]l and (O, the feedback information of User k can be shown as (fij.,7j,). At the base station, 
there are N different data sets 5„, where n £ [1, TV], corresponding to N different beams to store the feedback 
information of all users. That is, for any feedback (^^,7^), if n,, = n, then 7^. e 5„. Moreover, the maximum 



^Actually, it is not necessary for each user to offer liis/her feedback information. Instead, the same system throughput can be nearly acliieved 
by only allowing the strongest 10% users to provide feedback, whose received SINRs are above a predefined threshold level. This is the so-called 
selective multi-user diversity beneficial to greatly decrease the feedback complexity [19]. 
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SINR scheduling strategy is adopted at the base station to choose a user for transmission at each beam. Therefore, 
the index of scheduled user at Beam n is 

k = arg max 7^ , n = 1, • • • , iV (8) 

Finally, the maximum SINR directing the transmission at Beam n is 7- . 

III. The Implementation of OBS With Nt < N < Nf 

In this section, we show three practical beamforming schemes with Nt < N < Nf. In general, the instantaneous 
beamforming matrix B{t) shown in ^ is constructed with a fixed initial matrix B E £,NtxN ^^^^ ^ time-variable 
vector 



(9) 



in which On, n ~ 1, • ■ ■ 1 ^ fixed in time slot t but varied from time slot t to t + 1, furthermore, they are 
independent and uniformly distributed over [0, 27r). More accurately, B{t) is generated as 



B{t) = 



b, 6, • • • 6, 



(10) 
(11) 



= e^^^B{:,l) ei<^^B(:,2) ■■■ e^""B(:,iV) 

where B{: , n) refers to the n*'* column of B. Equation (fTTT i implies only a phase rotation is performed on each 
column of B to get B{t). Therefore, the correlation property of B is remained. 

From a purely information-theoretic point of view, using the deterministic initial beamforming matrix B yields 
the same system throughput as that if the time-variable B{t) shown in (fTTT i is applied in fast fading environment. 
The artificial randomness introduced by c{t) shown in (|9]l is to ensure fairness between in fast and slow fading 
environments [23]. The introduction of c{t) changes neither the correlation property between any pair of columns 
of B nor the distribution function of the received SINR. 

In what follows, the key point is how to design B e C^* ^ ^ to accommodate more users (larger N) and maximize 
system throughput. A straightforward idea is first to generate a unitary matrix with size N x N, and then choose 
its first Nt rows. Despite its simplicity, the main drawback of this construction is the correlation between different 
beamforming vectors 6^^ , n = 1 , • • • iV is not guaranteed at all. Moreover, the transmit power 
at different beams is randomized. 

Now, we present three different methods to construct B with constrained correlation property. 



A. Fourier-Based Construction 

It is well known that a Fourier matrix F £ C^' is an orthogonal basis in a iV^^ -dimensional complex space. 
Its projection into a iVi -dimensional complex space forms a tight frame whose elements have the broadest scattering, 
and this projection simply retains the first Nt rows of F [38]. Inspired by this observation, we propose to set the 
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TABLE I 

The comparison of minimum maximum cross-correlation of the Fourier, Grassmannian, and MUB-based 

CONSTRUCTIONS. THE GRASSMANNIAN-BASED ONE HAS THE BEST PERFORMANCE IF Af = 4, 7, 13. ALTHOUGH ONLY FOURIER-BASED 
ONE FUNCTIONS IF Af = 9, THE MUB-BASED ONE OUTPERFORMS IT IF Af = 16. 





Af 


<5o 


# selected rows 






KB,,) 


Lower bound 


52 


2 


4 


0.7071 


{2. 3} 


0.7071 


0.5774 


0.7071 


0.5774 


1 


3 


7 


0.7490 


{1.2, 4} 


0.4714 


0.4714 


\ 


0.4714 


1.3333 


3 


9 


0.8440 


{3, 7, 9} 


0.6565 


\ 


\ 


0.5 


2 


4 


13 


0.8597 


{1, 3, 4, 8} 


0.4330 


0.4330 


\ 


0.4330 


2.2499 


4 


16 


0.9061 


{1, 10, 12, 13} 


0.5817 


\ 


0.5 


0.4472 


3 



initial beamforming matrix as, where the subscript F refers to the Fourier-based beamforming, 



1 
1 

1 w 



1 

w 

Nt-1 



1 



Af — 1 



,,{Nt-l){N?-l) 



(12) 



in which w = e i^'^/^t _ 

For this choice, the correlation between transmit Beams I and n is 



1, 



1 



sin {n{l-n)/Nt) 
s\n(TT{l-n)/N'i) 



I = n 
I ^ n 



(13) 
(14) 



Roughly speaking, ( fT4b suggests the correlation of Bp behaves like a sine function and hence all the cross- 
correlations between a specific beam and the others are smaller than its auto-correlation. Therefore, different users' 
channels can be well matched by different beamforming vectors. 

However, it is not necessarily constrained to choose the first Nt rows, but instead the maximum cross-correlation 
between different beams can be further lowered by appropriately choosing another set of Nt components. Unfor- 
tunately, the optimal choice with the lowest maximum cross-correlation 



mm max c, 

1^71 • 



(15) 



requires exhaustive searching [38]. In Table U we list the number of selected rows with the minimum d{Bp) as 
shown in ( fTSl ). where Sq stands for the maximum cross-correlation with the fist Nt rows. For example, when A^t = 3 
and N = 9, shown in ( fT4b is plotted in Fig. |2] We observe that, with the best choice of {3, 7, 9} rows, the 
maximum cross-correlation is decreased from 0.8440 to 0.6565. 
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1.4r 



1.4r 



1.2 



1.2 



0.8440 



% O.l 



9 S 



0.6565 



i? 0.6 ■ 



0.4 ■ 



0.2 



i5 0.6 ■ 



0.4 ■ 



0.2 



() O 

9 9 



10 



10 



Fig. 2. The correlation coefficient c^ ^ sliown in 1141 of Fourier-based constructions with Nt = 3 and TV = 9, as a function of |i — n|. The 
left-hand panel coiTesponds to the beamforming matrix composed of the first three rows of Fouiier matrix with size 9x9. The light-hand panel 
refers to our optimal beamforming matrix with selected {3, 7, 9} rows. Obviously, the maximum cross-correlation is decreased from 0.8440 
to 0.6565. 



B. Grassmannian-Based Construction 

Our intention to find B E £^NtxN ^^^^ minimum maximum cross-correlation between any pair of N 

beamforming vectors, is equivalent to the Grassmannian line packing problem in the space C^*, which is to find 
a set of N lines that the minimum distance between any pair of lines is as large as possible [39]. Although the 
Grassmannian packing methodology has already been widely applied in the codebook design [40]-[43], it has 
seldom been employed in the design of opportunistic beamforming. To the best of authors' knowledge, only the 
tight Grassmannian frames are exploited to construct the beamforming matrix in the case with N ~ Nt + 1 [35]. In 
this subsection, however, we focus on the generalized cases with Nt < N < Nj, making the connections between 
Grassmannian frames and opportunistic beamforming design more transparent^ 

The Grassmannian frame {b^}, n = ,N minimizes the maximum correlation between frame elements 

among all unit norm frames which have the same redundancy defined by 

N 

Furthermore, if E C^' , n = 1, ■ ■ ■ , N, then the maximum frame correlation is lower bounded by [39, Theorem 

'Note please that the number of lines A'^ is of no any constraint in the separable infinite-dimensional Hilbert space for the Grassmannian line 
packing problem [39]. But in our opportunistic beamforming design, A'^ < is imposed because of the limited degrees of freedom in the 
downlink of MISO systems [34]. 
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2.3] 



6 = min max 16^6 I > 



' N-Nt 
Nt{N- 1) 



(17) 



Moreover, the equality in ( fTTj i can only hold if < N^, and also now {6^}, n = 1, • • • , is an equiangular 
tight frame. One achieving the equality in ( fTTl ) is called optimal Grassmannian frame. Unfortunately, although there 
are at most degrees of freedom in the downlink of MISO systems, the optimal Grassmannian frame does not 
always exist for any choices of Nt and N. For example, if Nt — 3, there are at most N = 7 frame elements for the 
optimal Grassmannian frame. In what follows, we give the initial beamforming matrix with maximum redundancy 
?/, that is, the number of transmit beams N is maximized while achieving the equality in ( fTTl ). according to the 
following Lemma [T] 

Lemma 1: (Konig [44]) Let p be a prime number and I over the field N of positive numbers, we set Nt = + 1 
and N ^ Nt - Nt + 1. Then there exist integers < di < ■ ■ ■ < d^^ < N such that all numbers 1, • • • , - 1 
occur as residues mod N of the Nt{Nt — 1) differences di — dq, i ^ q, 1 < i, q < Nt- For n = 1, • • • , A^, we 
define 



b 



1 



^j2TTndi/N j2-nnd2/N 



„j27rndjvt /N 



(18) 



and then the vectors 6„, n = 1, - • • , A^ form a harmonic optimal Grassmannian frame with maximum frame 
correlation ^/Nt ~l/Nt. 

1) Nt — 2, N — 4: In this case, according to [40, Table 11], the initial beamforming matrix where the 
subscript G denotes the Grassmannian-based beamforming, can be given by 

-0.1612 -0.7348j -0.0787 - 0.3192j -0.2399 + 0.5985j -0.9541 
-0.5135 -0.4128j -0.2506 + 0.9106j -0.7641 - 0.0212j 0.2996 
We can easily verify that the columns of form an equiangular unit norm frame. Furthermore, the equality 
of lower bound in ( fTTb is attained with S{B^) = 0.5774. Therefore, the columns of i?^ in (fT9] l make an optimal 



(19) 



Grassmannian frame. 

2) Nt =^ 3, N = 7: We get di 
them into ( fTSl ), we have 



0, d2 — 1, and d^ ^ 5 through exhaustive searching, and then substituting 



0.5774 


0.3600 4 


-0.4514j 


-0.1285 


- 0.5629j 


0.5774 


-0.1285 


+ 0.5629j 


-0.5202 


+ 0.2505j 


0.5774 


-0.5202 


+ 0.2505j 


0.3600 4 


- 0.4514j 


0.5774 


-0.5202 


- 0.2505j 


0.3600- 


^ 0.4514j 


0.5774 


-0.1285 


- 0.5629j 


-0.5202 


- 0.2505i 


0.5774 


0.3600 - 


- 0.4514j 


-0.1285 


+ 0.5629i 


0.5774 


0.5774 


0.5774 



(20) 



which achieves the equality in (fTTl l with S{B^) = 0.4714. 
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I, — 3, and = 9 through exhaustive searching, and then 



(21) 



3) Nt = A, N = 13; We show that di = 0, da 
substituting them into (fTSl l. we get 

0.5 
0.5 
0.5 
0.5 
0.5 
0.5 
^ 0.5 
0.5 
0.5 
0.5 
0.5 
0.5 

0.5 0.5 0.5 0.5 

which achieves the equahty in ( fTTI i with S{B^) = 0.4330. 

Remark 1: With the Fourier-based construction, we minimize the maximum cross-correlation between different 
transmit beams. On the other hand, different transmit beams are forced to be equiangular with the Grassmannian- 
based construction, and also they have the maximum distance between any pair of beams. However, we claimed in 
Section U that there are at least N — Nt transmit beams that are no longer orthogonal with the others. Therefore, 
a natural question to ask is: Can we design a beamforming matrix B e (jNtxN ^^^^ jy^ orthogonal vectors while 
simultaneously they have the same cross-correlation with the rest N — Nt vectors? The answer is yes, but we have 
to rely on the concept of mutually unbiased bases (MUB) elaborated in the next subsection. 



0.4427 + 


0.2324j 


0.0603 -t 


- 0.4964j 


-0.1773 


- 0.4675j 


0.2840 + 


0.4115 j 


-0.4855 


-h0.1197j 


-0.3743- 


f 0.3316j 


0.0603 + 


0.4964j 


-0.1773 


- 0.4675j 


0.4427 4 


0.2324j 


-0.1773- 


f 0.4675j 


0.4427 - 


- 0.2324j 


0.0603 - 


0.4964j 


-0.3743- 


F0.3316j 


0.2840-1 


-0.4115j 


-0.4855- 


f 0.1197j 


-0.4855- 


F0.1197j 


-0.3743 


-l-0.3316j 


0.2840 4 


0.4115j 


-0.4855- 


- 0.1197j 


-0.3743 


- 0.3316j 


0.2840- 


0.4115j 


-0.3743- 


- 0.3316j 


-0.2840 


- 0.4115j 


-0.4855 


-0.1197j 


-0.1773- 


- 0.4675j 


0.4427 -t 


- 0.2324j 


0.0603 4 


0.4964j 


0.0603 - 


0.4964j 


-0.1773 


+ 0.4675i 


0.4427- 


0.2324j 


0.2840- 


0.4115j 


-0.4855 


- 0.1197i 


-0.3743 


- 0.3316j 


0.4427- 


0.2324j 


0.0603-^ 


- 0.4964j 


-0.1773- 


f 0.4675j 



C. MUB-Based Construction 

Let U = , • • • , } and V = {v-^ , } be orthonormal bases of C^', U and V are mutually unbiased 
if the cross-correlation of vectors satisfies 

\ufvj^^, l<l,n<Nt (22) 

Furthermore, the set B = {Ui, ■ ■ ■ ,Us} is known as an MUB. It is reported in [45] that B can be constructed 
according to the following Lemma |2] 

Lemma 2: (Gow [45]) Let Nt be a power of 2 and let X consisting of unitary matrices be an irreducible complex 
representation of of degree Nt, where denotes a finite group of order Nf". Let D he a Nt x Nt matrix that 
satisfies D^'+i = J and D-^X{x)D = X {S{x)) for all x in G„^. Then the powers D, D^, , = I 

define Nt + 1 pairwise mutually unbiased bases. Furthermore, all entries of D are in the field (Q)(\/— !)■ 
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Based on Lemma |2] our initial beamforming matrix B^^^ in which the subscript M denotes the MUB-based 
beamforming, can be given by 



D 



D 



Nt 



(23) 



Note that £)^'+i = / corresponding to the case of transmit antenna selection is abandoned, due to the limitation 
of Nf degrees of freedom in the downlink of MISO systems. Unfortunately, for the cases under consideration with 
Nt < 4, the powerful Lemma|2]can only be exploited in the cases with Nt — 2, 4, rather than the case with Nt — 3. 

1) Nt^2, N = A: In this case, D is given by [45] 



D 



-1 j 
1 j 



Substituting it into ( |23] l. we have 



1+j 



-1 J 
1 J 



-1 



(24) 



(25) 



2j A^f = 4, = 16.- Based on [46], we can easily show that D can be given by 













~3 


-3 


-3 


-j 




















1 


1 


-1 


1 


-1 


















D 


~ 2 






























~3 


-i 


j 


3 






















-1 


1 


1 


-1 












it into (l23Tl yields 


























-.7 


-3 -3 


-3 -1 


-1 


-3 


3 


-1 




3 


1 


3 


1 3 


-1 


1 


1 


-1 1 


-1 -3 


-j 


-1 


1 


-1 




-j 


-1 


3 


-1 3 


1 


2 


-J 


~3 j 


3 ~3 


3 


-1 


-1 


3 


-1 


-1 


-j 


3 


1 -J 


1 




1 


1 1 


-1 1 


-1 


3 


i 


-3 


1 


-1 


-3 


3 


-1 -3 


-1 



(26) 



(27) 



In Table U the minimum maximum cross-correlations 6{B^) and S{Bj^j) of Grassmannian and MUB-based 
constructions, respectively, as well as the lower bound shown in (fTTI i are also listed, with respect to different 
number of transmit antennas Nt and number of transmit beams N. 

IV. Asymptotic Throughput Analysis 

In this section, we investigate the system throughput and thus give definite answer to which kind of beamforming 
scheme is most preferable for a specific {Nt, N) configuration, among Fourier, Grassmannian and MUB-based 
constructions. 



A. Received SINK of User k 

For any two non-orthogonal beamforming vectors 6, and where I n, can be expressed in reference to 
b^ through their cross-correlation coefficient (5, , that is, 

6, ^ 5fi^ + Jl- 5^b^, l<l,n<N (28) 
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where stands for the orthonormal vector to . Substituting (l28T l into (|6]l, the received SINR of User k at Beam 
n can be rewritten as 

7„,. = ^ ^^-^ (29) 

1 + ^ E \s,Kb^ + ^/T^Kb^\^ 

1 = 1, l^n 

^' " (30) 



l + £^S^\h,bJ^ 

where we explored the approximation h^b^ w in dSOl l. with the assumption that the beamforming vector b^ 
matches perfectly with the channel when the number of active users is large enough. The parameter (5^ is a 
constant determined by the correlation structure of the beamforming matrix, which can be calculated respectively 
as follows. 

1) Nt = 2, N = A: In this case, we observe from Table U that the Grassmannian-based beamforming is better 
than Fourier or MUB-based construction, since its minimum maximum cross-correlation 5{B^) achieves the lower 
bound 0.5774. Furthermore, the Grassmannian-based beamforming matrix in il9[ is equiangular, so that 

P = 3x 0.57742 ^ ^ 

2) Nt = 3, N = 7: In this case, it is observed from Table U that the beamforming matrix with Grassmannian- 
based construction has the same performance as that of the Fourier-based one with selected {1, 2, 4} rows. They 
both achieve the low bound 0.4714 and thus 

^2 = 6 X 0.47142 = 1.3333 (32) 

3) Nt — 3, N = 9: We find from Table U that only the Fourier-based construction functions in this case, though 
S{Bp) = 0.6565 is larger than the lower bound 0.5. From the right-hand panel of Fig.|2] we have 

(5^ = 0.2280^ + 0.4285^ + 0.5774^ + 0.6565^ + 0.6565^ + 0.5774^ + 0.4285^ + 0.2280^ = 2 (33) 

4) Nt — A, N — 13.- In this case, it is shown in Table |T] that the beamforming matrix with Grassmannian-based 
construction has the same performance as that of the Fourier-based one with selected {1, 3, 4, 8} rows and the 
lower bound 0.4330 is achieved, hence 

52 = 12 X 0.4330^ = 2.2499 (34) 

5) Nt — A, N — 16.- In this case, we observe from TableUthat the MUB-based beamforming matrix outperforms 
the Fourier-based one, though they both don't arrive at the lower bound 0.4472 but 5{B^j) = 0.5 of the former is 
much closer to it than S{Bp) = 0.5817 of the latter Therefore, 

P = l2x 0.52 = 3 (35) 

All the above values of (5^ are also listed in the last column of Table U 
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B. Asymptotic Distribution of N Maximum Received SINRs 

At the base station, we arrange the K received SINRs in Set 5„ corresponding to Beam n as 7^ , • • • ,7^ in an 
ascending orderj^ where k — 1, ■ ■ ■ , K has the same meaning as 7^ ^ in ^ but the beam index n is ignored 
here for the sake of notation brevity. Then, we turn to find the limiting distribution of the N upper extremes of 
order statistics 7i, • • • ,7^, by applying the asymptotic theory of extreme order statistics. 

It is straightforward to show that z = |/i^6^J^ in (|30] | is of the chi-square distribution with two degrees of 
freedom, that is, its PDF can be given by 



/^(z) = mexp(— 7712:), z>0 
Thus, after some manipulations, the PDF and CDF of 7^ in dSOl l can be shown respectively as. 



/r.(7) 



mN 



■ exp 



and 



Fj, (7) 1 - cxp 



p(l - 5^) 



7 < 



1 

7 < — 

S2 



1 



(36) 



(37) 



(38) 



Resorting to the well-known von Mises's sufficient conditions in the asymptotic theory of extreme order statistics 
[47], [48], we substitute ( |37| | and (l38l l into the growth function defined by 

l-^^r.(7) 



7(7) 



/r.(7) 



(39) 



and then it is straightforward to show the limit of the derivative of g{j) is, as 7 ^ 1/(5^, 

lim Ml)^o 

7^1/52 d7 

Therefore, (7) is in the domain of attraction of Gumbel-type limiting distribution g (7), where [47, p. 296] 



(40) 



-^^3,0 (7) = exp(-e 

That is, the limiting CDF of the maximum received SINR 7^, over 7^ , • • • , 7^^, is 



lim F (7) = lim ^;,(7) 



K 



lim 



1 — exp 

7 — 



K 



in which we used ( |38] | in ( l43b . Moreover, the position parameter a is the solution to [48, Theroem 2.1.3] 

1 



1 - F,^^ (a) = 



K 



(41) 

(42) 
(43) 
(44) 

(45) 



'There are at most K SINR values in Set <S„ and meanwhile the other TV — 1 sets are all empty, which means all users simultaneously have 
their maximum received SINRs at Beam n. 
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Fig. 3. The inefficiency of Gumbel-type limiting distribution. 



Substituting (l38]l into (|45]l yields 



p\nK 



niN + p6'^ In K 

On the Other hand, the scale factor b can be obtained as [48, Remark 2.7.1] 

b = g{a) 

1 - ^r, (a) 

pmN 



Moreover, based on [48, Theorem 2.8.1], their respective limiting CDFs of N upper extremes of 7^, 
that is, 7^, 7k_^, • • • , 7if-K+i' can be given by, as K ^ +00, 

(a + &"/~) = exD f— e — 



^r,._+, (« + &7) = exp i-e--') J2 



1=0 



n ' 



n=l,--- ,N 



Finally, it is straightforward to show that their limiting PDFs are, respectively, as K 



exp (— e '''), ?T.= l,---,iV 



(46) 

(47) 
(48) 

(49) 

(50) 
(51) 



where r(.) refers to the Gamma function. 

It is well known that the KuUback-Leibler distance 'D{f\\g) which behaves like the square of the Euclidean 
distance [49, p. 299] is a measure of the inefficiency of an approximate distribution g to its true distribution /. 
Thus, we may exploit it to check the inefficiency of the Gumbel-type hmiting distribution ( |44] |. Specifically, in our 
numerical evaluation, we compare the true PDF 



K-l 



(52) 
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with its limiting PDF 

of the maximum received SINR 7^^, and the Kullback-Leibler distance is defined as [49, p. 231] 

+00 

VifWg)^ J f{j)\og^M (54) 


In Fig. |3] we show the numerical results of ( |54| | with p — OdB, m ~ 0.5 and 3. We observe that the Kullback- 
Leibler distance is only 0.14 bits if K — 8 and m — 0.5, and it decreases as increasing m. For example, it is about 
0.025 bits if K = 8 and m ~ 3. Furthermore, it further decreases as increasing K and finally it approaches zero 
as K > 23. Therefore, the Gumbel-type limiting distribution (l44l) is a good approximation to its true distribution 
of the maximum received SINR. 



C. Throughput Analysis 

We suppose that all N scheduled users have simultaneously the maximum SINR at N different transmit beams, 
then the system throughput is upper bounded by 

Ru < iV£;{log2(l + 7K)} (55) 

N f — - 

= J J log2(l+7)e'^exp(-eT)d7 (56) 



where 7 = — (7 — a) /b. 

On the other hand, if N scheduled users always have different SINR at N different transmit beams, that is, if we 
ignore the small probability that at least two scheduled users obtain the same SINR, then the system throughput is 
lower bounded by 

Ri > eI J2 log2(l+7j| (57) 

(58) 



\ J log2(l + 7) (E^) exp(-e-)d7 



= \T.^) j (1 + 7)e"^ exp (-e^ d7 



(59) 



- r(n) 

in which we used ( BTT i in (1581 ). Unfortunately, Ru and Ri above can only be calculated by numerical integration. 

Actually, when the number of users is large enough, all N scheduled users have almost the same SINR at N 
different transmit beams and hence the system throughput approaches the upper bound. Therefore, the upper bound 
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Ru shown in ( fSST l can be analytically reformulated as 



R: 



'U 



< 



7Vi?{log2(l+7..)} 



(60) 



< 



N\og,{l + E{jJ) 



(61) 



N log2 (1 + a + bT) 



(62) 



N\og2 1 + 



pmN(T + Ini^) + ^^^2(1^^)2 
{mN + pP In A') 2 



(63) 



where we used the Jensen's inequality in (1611 1. and also in (|62] | we explored the Gumbel-type limiting distribution 
function as shown in (|44] |. which has a mean T = 0.5772 • • •, corresponding to the Euler-Mascheroni constant. 
Moreover, we exploited (|46] | and ( |49] l in ( |63] |. 

Remark 2: In comparison with the orthogonal counterpart with N — Nt, we find from ( [29b that the received 
SINR in our proposed scheme with Nt < N < is greatly decreased due to increased multi-user interferences 
as well as their mutual non-orthogonality, which will dramatically deteriorate the system throughput. However, this 
deterioration will be compensated by the increased spatial multiplexing gain A^, as shown in ( 1631 ). Anyway, the 
most important characteristic of our proposal is able to serve as large as Nf users simultaneously, which benefits to 
significantly decrease the scheduling latency. Moreover, we ignored the minimum data-rate requirement of each user 
in this paper. If we take it into account, the number of simultaneously transmitted users will possibly be decreased. 



A. The Effectiveness of Closed-form Upper Bound 

In this section, we first show the accuracy of our closed-form upper bound (|63] | in comparison with its asymptotic 
counterpart ( |56] | as well as the Monte-Carlo simulation results. In our simulations, the minimum number of users 
is 16, since there are at most 16 transmit beams if Nt — 4. On the other hand, the maximum number of users is 
set to be 2048. Although there will not be so many users in practical cellular communication systems, we are able 
to confirm the validity of our throughput analysis by comparing the numerical results with the simulation ones for 
such a large number of users. 

In Fig. m we show the system throughput of OBS with Grassmannian-based beamforming matrix as shown in 
(I20I 1. where Nt = 3, N — 7, and m — 0.5. We observe that the closed-form upper bound (|63] | almost always 
overlaps with the asymptotic (l56l l, no matter how many users there are or whatever SNR is or 5 dB. However, 
although we claimed in Section IIV-CI that all the scheduled users have almost the same maximum SINR as the 
number of users approaches infinity, there is always a very small gap between the simulation results and the upper 
bound as shown in Figs. |4]and|5] For example, when m = 0.5, p = OdB and the number of users K = 64, it is 
observed from the left-hand panel of Fig. |4] that the difference between ( 1631 ) and the simulations results is about 
0.06 bit/s/Hz, or 1.6% in relative to the simulation result 3.93 bit/s/Hz. Moreover, this gap becomes smaller and 
smaller as the number of users or the average SNR increases. The same observation can be attained from Fig. |5] 
where the beamforming matrix is based on MUB construction as shown in ( |27] |. Nt — A, N — 16, and m — 3. 



V. Simulation Results and Discussion 
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Fig. 4. The system throughput of OBS with Nt = 3, N = 7, and Grassmannian-based beamforming; m = 0.5. 




Fig. 5. The system throughput of OBS with Nf = 4, N = 16, and MUB-based beamforming; m = 3. 
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Fig. 6. The system throughput of OBS with Af = 4, 7, 9, 13, 16, and m = 0.5. 



Therefore, we conclude that our upper bound ( l63T l is very tight with simulation results, and thus it can be exploited 
below to evaluate the system throughput effectively. 

B. System Throughput of Proposed Schemes 

According to ( l63T l. we compare the system throughput of proposed schemes in Figs. |6] and El where the 
beamforming construction of = 4, 7, 13 are Grassmannian based, = 9 is Fourier based, and = 16 is 
MUB based, respectively. We find that the OBS with Grassmannian-based beamforming achieves the maximum 
system throughput whenever iVt = 3 or 4, corresponding to = 7 or 13, respectively. But if we want to fully 
exploit the degrees of freedom when A't — 3, 4, then we have to rely on the Fourier and MUB-based construction, 
that is, A^ = 9, 16 users can be simultaneously transmitted, respectively. Unfortunately, the increase of the number 
of simultaneously scheduled users is at the penalty of system throughput. For example, when K — 64 and p = dB, 
we observe from the left-hand panel of Fig. |6] that the throughput difference between the cases with N = 7 and 
A^ = 9 is about 0.19 bit/s/Hz, or 4.8% throughput loss of the case with A^ = 9 in relative to the throughput 
3.99bit/s/Hz of the case with N ^ 7. Moreover, this throughput loss will slightly increase as the number of 
users or the average SNR increases, but it decreases fast as the the variance 1/m of Rayleigh fading decreases 
by comparing Fig. |6] with Fig. |7] Furthermore, we observe that the system throughput degrades with decreasing 
variance 1/m by comparing Fig. |6] with Fig.]?] This degradation should not come as a surprise since the multi-user 
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m=3, p=OdB m=3, p=5dB 




# users (K) # users (K) 



Fig. 7. The system throughput of OBS with Af = 4, 7, 9, 13, 16, and m = 3. 

diversity gain will be smaller and smaller as the channel fading becomes more and more stable [29]. 

C. The Preferred Low SNR Case 

We point out that the proposed schemes with Nt < N < transmit beams is much beneficial to the low SNR 
case over the high SNR scenario. In Fig. |8] we see that the system throughput increases very slowly as the number 
of users increases sharply from 16 to 2048, where m = 1 and p = 10 dB. That is, when the SNR is high enough, 
the multi-user diversity gain becomes saturated soon. This phenomena can be understood as follows: We see from 
(|30] | that the received SINR 7^ ^ can be approximated to 1/(5^ if p is large enough, that is, the value of 7^^ ^ is 
independent of the user index k and therefore the multi-user diversity gain vanishes. 

D. Throughput Comparison With Orthogonal Counterpart 

Although our proposed schemes can schedule much more users than the number of transmit antennas Nt, how 
about the system throughput in comparison with their conventional orthogonal counterparts in which the number 
of transmit beams N equals Ntl In Figs. |9] and [TOl we show their system throughput comparison. Usually, in 
each cell of a practical cellular communication system, there are only tens of simultaneously active users. In this 
regard, we can see from the right-hand panel of Fig. |9] that the system throughput of our proposed scheme with 
A^t = 4, = 13 and its orthogonal counterpart with A^t = 4, = 4 are 6.06 and 7.31bit/s/Hz, respectively, if 
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m=1, p=10dB 

7i : 




10^ 10^ 10^ 

# users (K) 



Fig. 8. The system throughput of OBS with AT = 4, 7, 9, 13, 16, m = 1, and SNR = 10 dB. 

m = 0.5 and K — 128. In other words, there is 18% throughput loss but the scheduled users is of 225% increase! 
Furthermore, when the channel becomes more and more flat (as m increases), the throughput loss turns to be smaller 
and smaller, and even if to = 3 and K < 128 as shown in Fig. [TOl the system throughput of our proposed scheme 
with A'^t = 4, = 13 outperforms that of its orthogonal counterpart as well as any other cases with Nt < 4. The 
underlying reason is that, as the number of active users is small, for example, in any practical cellular system, the 
multi-user diversity gain is strictly limited and thus larger spatial multiplexing gain of our scheme leads to larger 
system throughput. Therefore, the proposed schemes, especially, the case with Nt — 4, N ^ 13, is of great interest 
in practical employment. 

Remark 3: Actually, the proposed schemes can be generalized to the cases with the number of receive antennas 
Nr > 1. Although he/she has the potential to use up to degrees of freedom, we can employ a combining 
strategy to reduce effectively each user with A',. > 1 to a single-dimensional receive terminal [34], [50]. That is, 
the rank of received signal of each user is forced to be 1, and thus the number of simultaneously transmitted users 
remains all the same. 

VI. Conclusion 

Inspired by the degrees of the freedom in the downlink of MISO systems, we demonstrated how to 
transmit to more than A^f users simultaneously, whereas at most A^f users can be simultaneously scheduled in 
the conventional MISO beamforming systems. We proposed three different opportunistic beamforming schemes: 
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Fig. 9. Throughput comparison between proposed scheme and their orthogonal counterpart, m = 0.5. 




Fig. 10. Throughput comparison between proposed scheme and their orthogonal counterpart, m = 3. 
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Fourier, Grassmannian, and MUB-based constructions. The Grassmannian-based scheme achieves the maximum 
system throughput with the number of transmit beams N = 4, 7, 13 in the cases with Nt = 2, 3, 4, respectively, by 
taking the optimal Grassmannian frames as the beamforming matrices. However, it can not exploit all degrees 
of freedom when Nt > 2. On the other hand, if we want to fully exploit 9 and 16 degrees of freedom in the cases 
with Nt = 3 and 4, we may resort to the Fourier and MUB-based schemes, respectively, despite a little penalty 
on system throughput. Finally, the special Grassmannian-based case with Nt — 4: and iV = 13 was shown to be 
promising for practical employment in cellular systems, since it outperforms its orthogonal counterpart in terms of 
the number of simultaneously scheduled users but without any throughput loss. 
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